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SYSTEM AND METHOD FOR RECORDING VOICE AND THE DATA ENTERED BY .A CALL CENTER AGE 
NT AND RETRIEVAL OF THESE COMMUNICATION STREAMS FOR ANALYSIS OR CORRECTION 

The present invention relates to a system and method for 
5 analysing communication streams . 

Currently many commercial entities perform substantial 
amounts of their business via telephone or Internet contact 
with their customers. The analysis of such, contact can 

10 therefore help businesses to improve, the quality and 
efficiency of their services, and assist with customer 
retention and, ultimately, profitability. Attempts have 
been made previously to achieve such analysis in a 
satisfactory manner and to a satisfactory 'degree. For 

15 example, many businesses have, for some time, recorded some 
of their communi cat ions streams e.g. telephone calls between 
their staff and their customers. Traditionally this was 
done to satisfy regulatory requirements or to help resolve 
disputes. More recently, the emphasis has moved towards the 

20 reviewing of these interactions from a quality perspective. 
This is intended to identify good and bad aspects of 
particular calls with a view to improving the level of 
customer service given. Recently, for example, recording 
the activity on a PC screen has been undertaken to improve 

25 the completeness of the review procedure with the reviewer 
able to see how accurately staff are entering information 
received via the telephone. 

Also, it has been known to employ Call Detail Recording 
30 (CDR) systems to prevent perceived abuse of telephone 
systems and to apportion costs to the department or 
individual making the calls. 

Originally such records were printed out directly from the 
35 Private Automatic Branch Exchange (PABX) onto a line 
printer. Later, systems were designed to store this 
information in a database allowing more sophisticated 
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reporting and searching for calls on the basis of one or 
more of the stored call details. More recently, Computer 
Telephony Integration (CTI) interfaces have been provided 
that give this information in real-time, during the call. 

5 

Further, several systems currently exist that use call 
recording in combination with CDR or CTI and a database 
application in order to perform routine monitoring of calls 
with the intention of identifying weaknesses in individual 
10 Customer Service Representatives (CSRs) . Typically a small 
percentage of the CSR calls are reviewed and "scored" 
against a set of predetermined criteria to give an 
indication of the quality of that particular member of 
staff. 

15 

Within a call-centre environment, it should also be noted 
that rather than simply using standard PC office automation 
applications when dealing with customers, staff in most call 
centres use increasingly sophisticated applications that 
20 help them to handle the calls more efficiently and 
effectively. Helpdesk applications and telemarketing call 
scripting applications are examples of such applications. 
Most such applications share the following characteristics: 

(i) they guide the staff member through' the 
25 interaction, either explicitly suggesting questions to ask 

or implicitly e.g. by presenting a form with various fields 
to be filled in with details to be gleaned from the 
customer; and 

(ii) the progress of the call is, to some degree, 
30 dictated by the answers given during the call. 

As an example, a service call to a help desk may flow 
through the following stages: 

(i) take customer details and verify maintenance 
35 contract is up to date; 

(ii) take details of the problem being experienced; 

(iii) attempt to resolve using appropriate set of 
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diagnostic questions; 

(iv) if not solved, agree schedule for engineer to 

visit; and 

(v) give customer a problem reference number. 

5 

Within each of the above main stages there will be one or 
more sub-stages e.g. (i) (a) caller's name (i) (b) zip code 
etc . 

10 Current systems employed for CSR quality monitoring require 
the manual review of calls in order to determine a quality 
"score". This prohibits the review of more than a few 
percent of all calls and the accuracy of the human scoring 
process and its objectivity are at best questionable. When 

15 coupled with a small sample size of typically only a few 
calls per CSR per week, the resultant scores offer a limited 
degree of quality monitoring. Also there are only a few 
aspects of the call that allow for automated scoring. The 
most common such aspect is the call duration which, for a 

20 known telemarketing script, should be within a known range. 
. A portion' of the overall score (perhaps 5%) may be allocated 
to such an aspect of the call. 

Call centre management seeks to handle calls as efficiently 
25 as possible yet take advantage of cross -selling and up- 
selling opportunities whilst the customer is already on the 
line. To this end, the way in which calls are handled 
within the call centre is of critical importance to the 
overall efficiency and profitability of the operation. As 
30 the sophistication of call handling systems and processes 
increases/ so the complexity 'of the call flow designs 
increases such that it is very difficult to fully test and 
exercise these "scripts" prior to, or even during, their 
full-scale deployment. Sophisticated scripts may require a 
35 hundred pages or more of documentation including the "call 
flow diagram" and the associated questions and forms 
presented to the CSR. With many hundreds or even thousands 
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of possible paths through such a script, any one CSR will 
not have sufficient experience of many particular routes 
through the script to allow for the objective evaluation of 
the effectiveness or otherwise of, for example, the 
5 suggested way to handle a particular objection raised during 
a sale. 

Current systems typically store details of each call in a 
relation database typically with one record of more or less 

10 fixed structure per call. These details may include 
indications of the type of call, the "completion code", or 
whether a particular option was chosen or not etc. More 
advanced systems cater for the cases where a single "call 11 
actually contains more than one transaction (e.g. request a 

15 quotation for two different insurance policies) . This is 
typically done by subdividing the call into two or more 
parts each of which then has a database record associated 
with it . 

20 Existing systems use one or more database records per call 
and the identification of calls normally requires that the 
user determine in advance which aspects of the call's 
progress are to be stored. Timing information stored is 
often limited to total call duration. The relative- time 

25 between different points in the call is difficult to 
determine. Calls that "loop" through the same part of the 
script several times are not well described by a single 
database record. 

30 It has also been appreciated that customers and potential 
customers generally much prefer a "warm call" to a "cold 
call" i.e. a call where the caller obviously has taken the 
trouble to find out about a customer's previous dealings 
with their business rather than merely selecting the name 

35 from a list without any knowledge of the particular 
customer's details; many of which may already have been 
given previously. 
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Many businesses now proactively call their customers on a 
regular basis to try and "cross-sell" or "up-sell" with 
other products that they have determined may be of interest 
to the customer. 

5 

However, currently, where a follow-up call is to be made as 
the result of a previous call perhaps made several months 
earlier there is often little information available to the 
CSR responsible for the follow-up call. For example, a 

10 routine call to all mortgage customers may include the 
question "Have you reviewed your life insurance- cover 
recently"?. According to the answer given, the customer may 
be placed on a list for a follow up call the next time a new 
life insurance policy is launched. At the time of the 

15 original call, the CSR will likely do no more than tick a 
" Yes /No /Maybe " option . 

When the customer is called some months later, all that the 
■ new agent knows is that the customer previously said they 
20 may have been interested and the factual details presented 
about the customer's previous purchasing history. They do 
not know exactly how or in what tone of voice the customer 
responded several months earlier. 

25 Additionally, such systems are open to abuse by the CSRs as 
they may be rewarded, or at least measured, by the number of 
such follow-up leads they record. Hence there is a 
temptation for anything other than a point-blank "No" to be 
recorded as a "Maybe" . 

30 

It is also acknowledged that, in order to improve business 
processes, train CSRs and identify problems, there is a 
desire to identify individual calls or sets of calls which 
are handled particularly well or badly or that highlight 
35 deficiencies, opportunities, trends or anomalies. 

Existing systems limit the selection criteria to those that 
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can be derived solely from call details. * These use 
individual database records associated with each call to 
store data fields about each call. These are particularly 
inflexible and need to be designed prior to the query. For 

5 example, if a user identifies that they will want to search 
for calls where the "Help" key was pressed, they could tag 
each call with a binary flag and set this to true as the 
help key is pressed. A more sophisticated solution would be 
to use a time field and store the time at which the help key 

10 was pressed. However, if the user then wants to find calls 
where "Help" was pressed twice or more, this known previous 
database scheme proves inadequate. 

When considering call-replay, and irrespective of the reason 
15 for wanting to replay a specific call, there is often a 
desire to listen to a particular section of a call rather 
than the whole call. For example, an insurance claims 
handler will want to confirm that a driver now known to have 
a previous conviction disclosed this when asked during the 
20 initial policy quotation. Being able to jump directly to 
the portion of the call where the customer was asked this 
question avoids the need to browse through the whole call. 
Alternatively, a mail-order company CSR needing to correct 
an invalid address would like to jump to the part of the 
25 original call where the address was given. 

Current systems require users to either listen to the whole 
call or to jump around the call listening to snippets until 
they "home in" on the required section. This is time 
30 consuming and prone to error. 

Further it has long been possible to perform automated 
speech recognition in an attempt to transcribe or at least 
to highlight key words within telephone conversations. The 
35 capabilities of such systems (accuracy, tolerance of natural 
conversation speed, cost effectiveness) have been gradually 
improving during recent years. 



WO 01/52510 



PCT/GB01/00129 



7 

To obtain maximum accuracy, currently employed algorithms 
are very CPU and memory intensive. It is unlikely that many 
customers will be able to afford to deploy full recognition 
across all conversations they record. 

5 

Simply applying generic recognition to the whole of a 
conversation requires the recognition algorithm to infer the 
context and grammar from the conversation in order to refine 
its decisions as to which words it heard. This process is 
10 CPU intensive and less accurate than being explicitly told 
the current context of the conversation. 

It will therefore be appreciated that the current schemes of 
communication analysis are disadvantageously limited. 

15 

The present invention seeks to provide a system and method 
for analysing communication streams having advantages over 
known such systems and methods . 

20 According to one aspect of the present invention there is 
provided a system for communications recording and analysis 
including means for recording one or more communication 
streams, means for identifying the recorded streams, means 
for retrieving the content of said recordings- by identifier 

25 tags and wherein additional real-time information relating 
to the progress of the said communication streams is 
recorded. 

Preferably, the communication streams and associated 
30 progress streams are linked by means of a cross reference 
within respective call . detail records in a database of 
recordings . . 

In one embodiment, the communication streams can be 
35 interleaved with the call' flow recordings and a composite 
stream is recorded. 
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Advantageously, the progress information may be inferred 
from analysis, in real-time or later, of keystrokes entered 
at a computer/ terminal handling the interaction. 

5 Alternatively, the progress information may be inferred from 
analysis, in real-time or later, of computer mouse actions. 

Yet further, the progress information may be inferred from 
analysis, in real-time or later, of internet traffic 
10 emanating from, or terminating at, any of a number of 
computers/terminals handling the interaction. 

Additionally, the progress information may be inferred from 
analysis, in real-time or later, of the words and/or prosody 
15 spoken during the interaction. 

It should be appreciated that the contents of the call flow 
data stream may be used to refine the recording and/or 
analysis of the main stream (s) . 

20 

Preferably, retrieval of specific sets of recordings may be 
performed by selection from .the call detail records 
describing each of the recordings and/or the presence, 
repeated presence or absence of specified events or call 
25 states. 

Advantageously, the criteria may include relative or 
absolute timing information. 

30 Further, the presentation of, the content of individual 
recordings may be in the form of a graphical display 
including bars and/or pictures and wherein the location, 
colour and/or size of which may be determined by the 
recorded call flow recording. 

35 

In particular, the presentation of one or more call flow 
recordings may be in the form of a directed graph showing 
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the progress of the calls through the various states and 
transitions . 

Also, the position, colour, labelling, texture and other 
5 graphical attributes of the nodes and/or lines can be 
determined by the number of calls following that route; the 
time they spend in the various states and/or other 
attributes derivable from the call flow recordings. 

10 Still further, the display can be populated, and the 
graphical attributes modified, in real-time to reflect the 
timing information recorded within the call flow recordings. 

In particular, the speed of replay can be t varied, for 
15 example, including "paused", "forwarded" or "reversed" at 
any rate including "single -step 11 mode. 

Also, the set of call flow recordings may be refined by 
selecting, for example, via a double-click, a specific 
20 transition as discussed further later. 

Advantageously, specific nodes and/or transitions can be 
highlighted according to automatically or manually defined 
rules so as to draw attention to the presence or absence of 
25 particular call flow routes. 

The invention is advantageous in that the ability to track 
the progress of the call through the various stages allows 
more criteria to be subsequently scored automatically and 

30 hence objectively. Since computer- scoring can be easily and 
cost effectively achieved, it can be performed economically 
on a much larger sample of the calls - often the full 100%. 
For example, the time spent taking customer details should 
be within allowed ranges. Calls that did not result in the 

35 sale of product "X" should not have spent more than t 
seconds trying to sell this product before moving on to 
other propositions. Calls should not occur where the 
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"customer had to go back and re-enter details that should 
have been picked up earlier. Such a system allows perhaps 
20-30% of the overall quality "score" to be determined 
automatically and therefore objectively and across all 
5 calls. This gives much better statistical significance with 
less manpower. 

With regard to call flow design and call centre management, 
the invention can for example: 
10 present N examples of calls following a specific 

path through the call flow script e.g. a rarely 
exercised branch; and 

further filter these calls according to more 
traditional call detail records e.g. "Service 
15 calls for Model XYZ that ended without problem 

resolution" 

Having selected the set of calls of interest, these are then 
presented to the user and this can allow them to: 
20 review just a specified section of the call (e.g. 

answer to question 143) without having to dip into 
the whole call to determine the. appropriate part 
of the call; 

review the whole call so as to hear the required 
25 section in context; and 

see the timing information associated with the 
calls e.g. how long spent in the specified 
section; how long between start of sales pitch and 
this section etc. 

30 

Turning now to the personalisation of calls, the invention 
can advantageously allow the few seconds of response to the 
relevant question to be retrieved efficiently without the 
user having to listen to the rest of the call. Using this 
35 feature, two strategies can be adopted. First, all of the 
responses to the question can be reviewed by a dedicated 
individual or team prior to the campaign getting underway. 
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As they listen to all of the responses, they can (a) 
prioritise the customers from most to least likely to buy 
and hence choose which ones will eventually be called and 
(b) ensure that the script that is to be used when the 

5 campaign starts includes answers to the issues likely to be 
raised. Hence they can improve the effectiveness of the 
campaign and reduce the number of wasted calls that will be 
made. Note that feature (a) above requires all' responses to 
be listened to, whilst feature (b) only requires that a 

10 statistically significant sample be reviewed. 

Secondly, during the campaign, agents may be presented with 
the previous response either in audio form or in textual 
form (manually or automatically transcribed) . The former 

15 gives them the tonal and emotional content of the response 
but is less practical where autodiallers are used to present 
• calls to CSRs as soon as the customer answers. Either 
approach lets them speak to the customer from a position of 
advantage, knowing exactly how they reacted to the question 

20 when last asked. Presenting the previous response first 
would allow the CSR to determine whether it is worth placing 
the call or not. 

By recording the detailed progress of the call along with 
25 the call details and call content, the system advantageously 
allows the user to identify sets of "interesting" calls. 
Such calls might be those exhibiting the presence or absence 
of certain events within the call e.g. CSR requested "help" 
during the quotation process. This could identify CSR 
30 specific training needs or areas where all staff experience 
difficulty that could be improved. 

Alternatively, or in addition, where one product was 
purchased but details of others were also given, it might be 
35 worth calling the customer soon to try again to sell them 
these other products. 
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Also, where the CSR had to return to a previous part of the 
script, the reason for this can be considered- For example, 
was a question ambiguous, a mistake made or did the customer 
change their mind? 

"5 

Further, where less than 5 seconds was spent in the 
"confirmation" screen where customers should have the small 
print explained to them, this could suggest that the CSR was 
rushing on to the next call too fast . 

10 

Of course, poor typing skills might be indicated by 
excessive corrections and slow data entry. 

Efficient replay of calls, can advantageously be achieved by 
15 recording the detailed progress of the call along with the 
call details and call content so that the system allows the 
whole call to be shown on a Graphical User Interface (GUI) 
with graphical indications of the current state of the call 
and significant events within the call. The user can then 
20 choose which section (s) of the call to play by clicking on 
the appropriately marked points. Also, required segments of 
the call to be presented in isolation and played 
individually or one after the other. 

25 The present system, through monitoring and recording the 
progress of the call along with the audio content of the 
call, advantageously allows speech recognition algorithms to 
be directed at only those sections of conversations where 
maximum value is likely to be obtained. For example, no 

30 interpretation of the speech is undertaken whilst the 
customer's address is being taken. Likewise, interpretation 
can be concentrated on sections where sales objections are 
being handled or likely to be handled. Further, the 
algorithms can be directed in their selection of likely 

35 words according to the context of the conversation. For 
example, if a CSR has tabbed to the "Destination" field on 
screen, the customer is therefore more likely to have said 
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"Paris" than "Pairs". Also, the algorithm can be directed 
as to the likely speaker - whether CSR or customer. That 
is, where independent transmit and receive audio streams are 
not available, the progress through the call can be used to 
5 identify which speaker is most likely to be talking. 

The capabilities described above also allow the system to be 
used for the following additional and advantageous purposes. 
Usability analysis can determine how many keystrokes /mouse 

10 clicks were required to perform the most common functions 
and also identify the most common functions. Testing and 
Verification can also be achieved and used to find any calls 
where an order was taken that had not previously taken the 
customer through the required explanatory text thereby 

15 identifying an invalid path through the call flow script. 
CSR Quality Monitoring allows for the identification of 
those CSRs that spend more, or less, time than expected in 
specific sections of the call flow process or that follow 
certain paths through the call flow more or less often than 

20 the norm. 

The invention is described further hereinafter, by way of 
example only, with reference to the accompanying drawings in 
which: 

25 Fig. 1 is a block diagram of a recording and analysis 

system embodying the present invention; 

Fig. 2 is a functional block diagram of a recorder such 
as that in Fig. 1; 

Fig. 3 is a functional block diagram of an Application 
30 Programming Interface (API) for use within an embodiment of 
the present invention; 

Fig. 4 is an illustration of a graphical representation 
of the progress of a call; 

Fig. 5 is a flow diagram such as that displayed in 
35. accordance with »an embodiment of the present invention to 
illustrate call progress; and 

Fig. 6 is a further flow diagram derivable from the 
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diagram of Fig. 5. 

.Turning first to Fig. 1, one or more datastream recorders 4 
are connected to speech transmission circuits 2 which in 
5 turn are typically connected to a Private Automatic Branch 
Exchange (PABX) or similar telephony switching system 1. 
Connection is achieved by means of a high impedance tap 3 
which does not impact the normal use ..of the speech paths but 
also allows the recorder to monitor the signals on the 
10 paths. 

The recorders 4 are also connected to a local area network ' 
such as an ethernet (7) over which they communicate with 
other components of the system and can receive data for 
15 storage along with the voice data that they are recording 
from the speech paths (3) . This data can be provided by any 
application on the network such as central business 
applications running on servers 6 or on an end user's 
desktop 10 . 

20 

A server 5 consolidates details of all calls recorded by the 
recorders 4 which may be scattered across a local or wide 
area network and maintains a central database of "call 
records" which can be searched using standard SQL 
25 techniques. This also maintains details of the current 
location of removable media and the calls they contain. 

Also running on a server 5 as part of the recording system 
is typically an application, for example UNIFY, which 

30 interprets Computer Telephony Integration (CTI) data from 
the telephony switch 1. This information is used to control 
the recorders 4 and indicate when to start and stop 
recording on both the speech and data channels . This 
information is also used to "tag" the calls with information 

35 regarding the call, such as which extension 8 the call w£s 
directed to. The recording of the data streams can also be 
controlled by this application and these too can be tagged 
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with additional detail, such as which desktop 10 they relate 
to. Additionally, data stream recordings can be tagged with 
the identifier of the speech call that is in progress on the 
same desktop. For example, data from desktop 10 would be 
5 tagged with the identifier of the extension 8 on that 
desktop. 

The voice and data calls that have been recorded can be 
retrieved by applications running on PCs on the* network 11. 
10 These applications search the call-details database held on 
the server 5 to determine which call(s) they wish to replay 
and where these calls are currently held. The call content 
is then requested from the recorder holding the disk, tape 
or optical media on which it has been stored. 

15 

The following components of an overall system are currently 
available and are used as an underlying platform on which 
the specific enhancements used to provide the benefits 
described above can be deployed. 

20 

Multimedia Stream Recorders 

The recorder (4) is an example of a multi-channel voice 
•recorder capable* of storing up to 128 conversations 

25 simultaneously. Fig. 2 shows the relevant details of the 
recorder. The speech signals are presented via a high 
impedance tap 12 and are typically compressed by an 
appropriate line interface card 14. At regular intervals 
the blocks of data to be stored are passed via the internal 

30 data recording interface 16 for indexing and storage to hard 
disk 18, or optionally, Digital Audio Tape (DAT) media for 
longer term archive. Calls can be "tagged" with arbitrary 
data fields describing the call and subsequently retrieved 
for replay or analysis. In addition to recording voice 

35 calls, the recorder can record data streams representing 
arbitrary communication types. Examples include PC screen 
capture recordings, messages for display on underground 
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trains in-cab displays etc. These are typically presented 
as Internet Protocol (IP) packets via an ethernet cable 13 
connected to an appropriate Network Interface Card (NIC) 15. 
The contents of these packets are passed through the same 
5 internal storage API 16 for storage to disk. 

Computer Telephony Integration (O TT ) Control and Tagging. 

The UNIFY component mentioned above is used to interpret one 
10 or more data streams typically provided by telephony 
switches, Automatic Call Distributors (ACDs) or applications 
within a call centre. By passing the information flows from 
these and applying- customers defined rules, the UNIFY 
' component will control the recorders to start, stop, pause, 
15 resume or break recordings on specified voice and/or data 
channels and/or to "tag" recordings, current or past, with 
specified data. These data fields ultimately form part of 
that recording's call detail record and can later be used to 
search for it. 

20 

Fin t-.Rr prise -wide Recording and Re trieval Services 

E-Ware is an example of a suite of NT 4.0 applications which 
manage one or more recorders to provide, across the 
25 customer's local or wide area network, an enterprise-wide 
set of recording and retrieval services. Application 
Programming Interfaces (APIs) provide access to recording 
control, configuration, status monitoring, search and 
retrieval mechanisms. The system consolidates the call 

30 detail records from all the recorders in the system into a 
central, open relational database allowing queries of 
arbitrary complexity to be performed using SQL. 
Additionally, the system records the contents and current 
location of all removable media to which calls have been 

35 recorded. 
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One of the APIs of systems such as E-Ware provides for data 
recording capabilities and can be used by screen capture 
mechanisms. 

5 The generic data recording API 21 is implemented for example 
by Eware2DR.DLL on Windows platforms and allows applications 
19 to: 

register data streams as being available for 
storage; 

10 - start and stop recordings on such data streams 

with the data being transferred across the network 
23 via IP packets formed by an IP protocol st&ck 
22 and ultimately stored on the recorders (4) ; 
provide their own timing information within the 

15 data stream supplied or have the system 

automatically package the data inside a protocol 
that adds time information at the required level 
of accuracy and granularity. 

20 Replay Application 

Windows applications such as "Replay Studio" can be provided 
that allow a user to: 

specify search criteria and perform searches to 
25 select a required set of call details from the 

central database; 

view the results of these searches in tabular 
and/or graphical formats; 

retrieve the contents of selected calls from the 
30 recording system for delivery to a local or shared 

cache area for subsequent replay; and 
replay one or more selected calls in a way 
appropriate to the type of call. Examples are 
voice calls played via a soundcard and screen 
35 capture calls replayed in an on-screen window. 

The application consists of an ActiveX framework into which 
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additional data visualisation and replay mechanisms can be 
added to support new call types. 

Call Flow Recordings (CFRs) 

5 

The particular features related to this embodiment of the 
invention are as follows. The main purpose of these 
enhancements is to provide a Call Flow Recording (CFR) that 
details - to the level of detail required for a given 

10 application - the progress of a call through the system, 
this CFR is not a database record but is actually a 
recording of a real-time data stream that allows the 
progress of the call to be reconstructed including both the 
route it took through the call handling process 

15 potentially down to the individual key- strokes entered - and 
when each step occurred. 

These CFRs are stored within the generic recording system as 
"calls" of a new, well-known, Format type. This allows the 
20 retrieval and replay tools to recognise them and display 
them appropriately as opposed to trying to replay them as 
audio calls. The format identifier used is chosen to be in 
the range reserved for variable bit rate streams. 

25 These CFRs are tied to the other components of the call, 
such as voice recording and screen content record, by use of 
cross-reference fields within their call detail records. 
Each CFRs call detail record includes the globally unique 
reference number of the "parent 11 call - typically a voice 

30 recording - to which it refers . 

Additionally, "Parent" and "Child" flag fields can be used 
within the call detail records to alert applications to the 
fact that the voice call in question has one or more 
35 associated "calls" and, conversely, that the CFR has a 
related parent call and should not be viewed in isolation. 
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An alternative embodiment merges the context or progress 
information with the main recording so that the recorded 
data stream is an amalgam of the original plus the state 
information. A simple packetisation protocol allows the 

5 combined stream to be decoded into blocks of original and 
additional state information. This packetisation protocol 
may be further extended so as to include many different data 
streams within the single file. Any one or more of these 
streams may be extracted from the whole for subsequent 

10 analysis or replay. 

Ex plicit • Context Not ification 

Where applications involved in the progress of calls through. 
15 the call centre are required to explicitly advise the 
recording system of the call's current state, this can be 
achieved by using an API which is layered on top of the 
generic data recording API. This "Call Flow Recording. API" 
(CFR-API) provides the following functionality: 
20 - it advises that the call is entering stage N, and 

offers user definable parameters PI...Pn 
associated with this transition which are 
typically the name of the stage; 

it advises that the call is leaving stage N, and 
25 offers user definable parameters PI . ..Pn 

associated with this transition which are 
typically the name of the stage; and 
it advises that the event E that is occurring e.g. 
a sale being made, and offers user definable 
30 parameters PI ...Pn associated with this 

transition, for example the value of the sale) . 

It should be noted that several levels of detail can be 
stored since calls can enter multiple stages without leaving 
35 a higher level one. For example, a sales call may generate 
the following calls to the API as it is handled: 



WO 01/52510 



PCT/GB01/00129 



20 

entering Customer Identification Stage; 
entering Name Determination Stage; 

entering Surname Determination Stage; 
leaving Surname Determination Stage ; 
5 - entering FirstName Determination Stage; 

entering HelpScreen; 
leaving HelpScreen; 
leaving FirstName Determination Stage; 
leaving Name Determination Stage; 
10 - entering Address Determination Stage; and 

leaving Address Determination Stage . 

Tmplicit Context Determination 

15 In some cases it may be that not all of the applications 
involved in the handling of calls will be enhanced to 
provide the above-mentioned explicit notification of call 
progress. In such cases it may be necessary to infer the 
progress of the call from other sources such as screen 

20 capture, network (e.g. "web 1 ) traffic and/or speech analysis 
of the voice records . 

In such cases, these other data streams may be used as more 
or less satisfactory proxies for the missing or incomplete 
25 call flow recording. Three examples of how call progress 
can be inferred are given below. 

These inferred call flow elements may be analysed in real 
time and merged into the overall call flow recording or may 
30 be determined at a later date by retrieving the data 
recording from which they are to be extracted. 

First, the progress can be inferred from the key-strokes 
entered. As the entry of data via the keyboard is such a 
35 common and essential part of most call handling, keystroke's 
can be, by default, recorded as an integral ' part of the call 
flow recording for a given workstation. This is performed 
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using standard Microsoft Windows "hooks" that allow 
applications to intercept all keystrokes. The related data 
is then combined into the same recording as the explicit 
notifications. In effect, these provide the lowest level of 
5 granularity within the call flow recording, for example, 
appearing as : 

entering FirstName Determination Stage; 
KeyHit="Fl"; 
entering HelpScreen; 
10 - KeyHit="Esc" ; 

leaving HelpScreen; 
KeyHit="T"; 
KeyHit="o"; 
KeyHit="m" ; .and 
15 - leaving FirstName Determination Stage. 



The second means of inferring call progress relates to mouse 
clicks. As with keystrokes, mouse clicks and mouse 
movements can be sensed by default and inserted into the 
20 call flow recording. In addition to the basic action being 
performed, the co-ordinates on screen and as much detail as 
possible about the underlying control or application are 
recorded. 

25 Thirdly, although determining call flow by analysing the 
display on screen can prove quite complex and unreliable, in 
some cases it can prove advantageously simple and precise. 
For example, where screen capture is performed via 
Graphical-Device-Interface (GDI) call interception, an 

30 "Order Entry" stage may be defined as when focus is placed 
on the window with the caption "Order Processing System". 

Fourthly, whilst the discussions above have focused 
primarily on voice calls, the same principles and methods 
35 apply to visits by customers to a web site. The progress of 
the visitor around the web site and the analysis of the 
activities they are engaged in, such as browsing and 
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completing payment details, can be stored in exactly the 
same way as the progress of the voice call is tracked. In 
these cases, the web server may give explicit notification 
of call progress and/or this may be inferred from the web 
5 traffic being exchanged between the server and the visiting 
customer. 

Finally, where speech recognition software is deployed, 
either in real-time, or later on the retrieved recordings, 
10 the words that it recognises can be treated as a further, 
albeit "fuzzy", means of determining the progress of the 
interaction. For example, recognising "Visa" gives a good 
indication that payment details are under discussion. 

15 Real-time Control of Recording 

The architecture arising from the present invention allows 
for data streams to be recorded and/or acted upon to control 
the real-time actions of the system through the 
20 aforementioned UNIFY call control component. 

The call progress streams can therefore be treated as both 
a stream to be stored and a stream to be acted upon. Using 
the parsing mechanisms inherent within UNIFY, specific 
25 portions of the call that will start, stop, pause, resume, 
break or tag the recordings of the main communications 
streams can be identified. 

Search Mech anisms 

30 

Customizable forms are provided with which the user can 
specify the parameters and criteria that are to be used to 
select a set of calls for analysis and visualisation. The 
user may then choose to play the call(s) selected or to view 
35 their progress in.. any of the ways described below. 



The search specification mechanism allows the user to 
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specify: 



any required combination of call detail record 
parameters, for example call start time, campaign, 
wrap-up code. These can ultimately be expressed 
5 in, for example, an SQL "WHERE" clause; and 

one or more call flow parameters including one or 
more criteria of the following forms: 

calls passing through one or more specific 
point in the call flow diagram a specified 
10 number of times (including zero) ; and 

calls spending more or less than a specified 
time between specified points in the call 
flow diagram. 

15 As users refine their search criteria, it then becomes 
possible to apply the new criteria to: 
- . the set of all calls; 

the results of the previous search; and 
the results of any earlier search. 

20 

The search itself is performed as a combination of SQL query 
statements selecting the set of calls that satisfy the 
requested call detail parameters and a process whereby the 
call flow record of each of these calls can be retrieved and 
25 analysed to determine whether or not the call satisfies the 
call flow criteria specified. 



An optimisation of the above allows for more rapid responses 
to such searches by recording the results of commonly 
30 performed call flow record analyses and storing the results 
of these analyses in database tables which can then be 
searched efficiently using SQL queries when the system 
recognises one of these common search requests . 

35 Data Visualisation Methods 

Several different approaches can be used to show the call 
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flow. Each is appropriate to a particular form of analysis 
or reason for using the tools. For example, a call 
"storyboard" can be created. Traditionally, when a call was 
selected for replay, a simple "slider-bar" was displayed 

5 having a pointer moving from left to right as the call is 
replayed. More recently, the waveform of the recording has 
been illustrated and silent periods highlighted. Periods of 
interruption and, where screen capture has been performed, 
the volume of on-screen activity occurring over the duration 

10 of the call has also been included in the display. 

With the addition of a call flow recording,, the display 
illustrated in Fig. 4 can be further enhanced with coloured 
bars indicating the progress of . the call through various 
15 stages and sub-stages; pop-up text describing each stage 
should the user point at one of the bars; and icons 
signifying events during the call -including data entry. 

■ An example of these elements as would be overlaid on the 
20 voice and screen capture waveform displays is shown in Fig. 
4 with pop-up text explaining the bar being pointed to. The 
progress of the call is shown with the icons representing: 



Icon 


Meaning 


Handshake 


Customer greeting - standard 
salutation 


Bomb 


Customer raises complaint 


Dove 


Complaint resolved 
successfully 


Open hands 


New product offering explained 
and offered 


Gavel 


Price negotiations 


Tick 


Agreement to buy 


Money bag 


Payment terms agreed 
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Post Pigeon 



Delivery details agreed 



Graphical displays such as those shown in the top-level call 
flow diagram Fig. 5 are used to show the user the progress 
5 of one or more selected calls through the various stages of 
the call handling process. 

Specific features of such displays include nodes labelled to 
indicate the activity in progress, nodes coloured to 

10 highlight groupings (e.g. sales v service) and/or nodes 
shaped to distinguish between e.g. those with sub- structure 
within them that can be viewed by double -clicking on the 
node - as shown in "Basic Fault Finding" node, colours to 
highlight desirable/undesirable outcomes (e.g. abandoned 

15 call node) , lines between nodes to show the flow of calls 
from one stage to another, thickness of lines proportional 
to the number of calls following this path, lines being 
coloured to highlight unexpected routes as in the 
backtracking from delivery details to payment details 

20 discussed below, dashed lines used to highlight available 
but unused paths, and nodes sized to indicate the total time 
spent in this particular stage of the call flow. 

The example shown in Fig. G is a "drill-down" to finer 
25 detail within the "Basic Fault Finding" node of the previous 
diagram. In this example, the user has double clicked on 
the top level node to examine the detailed flow within the 
top-level node. This example shows by means of the dotted 
line, a possible call flow route that none of the selected 
30 calls took. In this case, the user might reconsider the 
value of the "Mains Lead" question since it did not resolve 
any of the selected calls . 



35 



Drill-down to Su bset of calls 

By double clicking on one of the lines, the user can select 
the sub-set of calls that went via that route. The other 
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lines in the diagram are then redrawn to show the rest of 
the call flow route taken by these particular calls. 

Dynamic C all Flow Diagrams 

5 

Such diagrams may be static, for example showing the overall 
call flow for the selected set of calls. Alternatively they 
may be dynamic, in which the lines are added and grow 
thicker in real time as calls are shown to move through the 
10 call flow at N times real-time. This allows users to see 
bottlenecks, delays and unusually sluggish or speedy calls. 

The speed at which the diagram populates can be adjusted to 
n times real-time where n may be 0 (paused) , or a positive 
15 value or negative i.e. "rewind" value. When paused, a 
"single-step" mode can be invoked in which each subsequent 
stage of the call is shown on demand or after a fixed time 
regardless of the actual time it took at the time of 
recording . 

20 

It should be appreciated that the invention is not 
restricted to the details of the foregoing embodiment, the 
particular features of which are merely illustrative. 

25 



30 



35 
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CLAIMS 

1. A communications recording and analysis system 
including means for recording one or more communication 

5 streams, means for identifying the recorded stream, means 
for retrieving the content of said recordings by identifier 
tags, and wherein additional real-time information relating 
to the progress of the said communication streams is 
recorded . 

10 

2. A system as claimed in Claim 1, wherein the 
communication streams and associated progress streams are 
linked by means of a cross reference within respective call 
detail records in a database of recordings. 

15 

3. A system as claimed in Claim 1 or 2, wherein 
communication streams are interleaved with call flow 
recordings and a composite stream is recorded. 

20 4. A system as claimed in Claim 1, 2 or 3 , and arranged 
such' that, the progress information is inferred from 
analysis, in real-time or later, of keystrokes entered at a 
computer/terminal handling the interaction. 

25 5. A system as claimed in Claim 1, 2 or 3 , and arranged 
such that the progress information is inferred from 
analysis, in real-time or later, of computer mouse actions. 

6. A system as claimed in Claim 1, 2 or 3 , and arranged 
30 such that the progress information is inferred from 
analysis, in real-time • or later, of internet traffic 
emanating from, or terminating at, any one or more of a 
number of computers/terminals handling the interaction. 

35 7. A system as claimed in Claim 1, 2 or 3, and arranged 
such that the progress information is inferred from 
analysis, in real-time or later, of the words and/or prosody 
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spoken during the interaction . 

8. A system as claimed in any one of Claims 1-7, and 
arranged for the retrieval of specific sets of recordings 

5 through selection from the call detail records describing 
each of the recordings and/or the presence, repeated 
presence or absence, of specified events or call states. 

9. A system as claimed in any one of Claims 1-8, and 
10 including graphical display means for the display of the 

content of individual recordings . 

10. A system as claimed in Claim 9, wherein the graphical 
display means is arranged such that the location, colour 

15 and/or size of display elements is determined by the 
recorded call flow recording. 

11. A system as claimed in Claim 9 or 10, wherein the 
graphical display means is arranged such that the 

20 presentation of one or more call flow recordings is in the 
form of a directed graph showing the progress of the calls 
through the various states and transitions. 

12. A system as claimed in Claim 9, 10 or 11, wherein the 
25 graphical display means is arranged to be populated, and the 

graphical attributes modified, in real-time to reflect the 
timing information recorded within the call flow recordings. 

13. A system as claimed in Claim 9, 10, 11 or 12, wherein 
30 specific elements of the display are arranged to be 

emphasized according to defined rules so as to draw 
attention to the presence or absence of particular call flow 
routes . 



35 
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